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Abstract 

In network tomography, we seek to infer the status of parameters (such as delay) for links inside a network through 
end-to-end probing between (external) boundary nodes along chosen routes. In this work, we apply concepts from 
compressed sensing to establish conditions on the routing matrix under which it is possible to estimate link delay from 
end-to-end measurements and also provide an upper-bound on the estimation error. Further, we investigate choice of 
the appropriate set of paths within a given network with defined boundary nodes, so as to minimize the number of 
probes needed for estimation. Simulation results show that the proposed algorithm will, with high probability, achieve 
forty percent reduction in overhead required compared to probing between every available pair of boundary nodes. 

Index Terms 

Network Tomography, Compressed Sensing, Graph Theory, Expander Graphs, Optimization 

I. Introduction 

Monitoring of link properties (delay, loss rates etc.) within the Internet continues to be an integral requirement 
within any network management framework, as part of monitoring it's utilization and performance. Within the 
context of an ever-expanding Internet, the need for accurate and fast monitoring schemes has escalated in recent years 
due to the increasing popularity of new resource-consuming services (such as video-conferencing, Internet telephony, 
and on-line games) that require quality-of-service (QoS) guarantees. In 1996, the term network tomography was 
coined by Vardi [1] to encompass a class of approaches that seek to infer internal link status from end-to-end 
measurements. A useful classification of network tomography methods for our purposes is as follows [2]: 

• Cooperative Internal Nodes: assumes presence of internal nodes on a route that respond to control packets. 
For example, active probing tools such as ping or traceroute, measure and report attributes of the round-trip 
path from external sender to internal node based on separate probe packets [3]. The challenges of such methods 
arise from the fact that many service providers do not own the entire network and hence do not have access 
to the desired internal nodes [4], [5]. 

• End-to-End: In networks with a defined boundary, it is assumed that access is available to all nodes at the 
edge (and not to any in the interior). A boundary node sends probes to all (or a subset) of other boundary 
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nodes to measure packet attributes on the path between network end points. Clearly, these edge-based methods 
do not require exchanging control messages with any interior nodes. The primary challenge confronting such 
end-to-end probe based link status estimation is that of identifiability, as will be discussed later [6], [7]. 
As the Internet evolves towards more decentralized, uncooperative and heterogeneous administrative (sub)domains, 
availability of cooperative interior nodes will be increasingly limited. Hence end-to-end network diagnostic tools are 
attracting increasing attention as a result. In end-to-end network tomography, probes are sent between two boundary 
nodes on pre-determined routes; typically these are the shortest paths between the nodes based on existing internet 
routing protocols. For parameters like delay, an additive linear model adequately represents the relation between 
measured path and individual link delay, and can be written as [8], [9] 

y = Rx (1) 

where x is the n x 1 (unknown) vector of individual link delay. The r x n binary matrix R is the routing matrix 
for the network graph corresponding to the paths chosen for the probes (each row of the matrix is a path) and 
y 6 W is the measured r-vector of end-to-end path delays. Although, the focus of this paper is link delay, our 
approach readily applies to any other link attributes (such as packet lost rate) which allows such a linear relation 
with end-to-end measurements. 

Link delay estimators based on Eq. (1) can be classified under: 

1) Deterministic: the delays are considered unknown but constant. Since the link delay is typically time varying, 
such approaches are suitable for periods of local 'stationarity' where such an assumption is valid. 

2) Stochastic: the link delay vector x is specified by a suitable a-priori parametric probability distribution; the 
method then estimates the unknown parameters of the model. For example, [10], [8], [9], [11] assume that 
link delay follow a Gaussian or an exponential distribution. 

There exist challenges with both modeling approaches. Stochastic models are usually more computationally 
intensive than deterministic ones [12] as they suffer from over-modeling (too many parameters for the data). 
Moreover, in many scenarios, one is typically interested in only the few links that are congested (i.e. excessive link 
delay). Deterministic models are better suited to exploit this (side) information; our method falls within this class. 

In Eq. (1), typically, the number of observations r <C n, because the number of accessible boundary nodes is 
much smaller than number of links inside the network. Thus the number of variables in Eq. (1) to be estimated 
is much larger than the number of equations [11], leading to generic non-uniqueness of solutions to Eq. (1), i.e., 
inability to uniquely determine link delay [10] from end-to-end measurements. 

A network administrator is typically interested in identifying only the (few) links with large delays (or high 
packet lost rate) at any given time; this side information suggests mechanisms to solve the under determined system 
in Eq. (1) provided that the sparsity of the desired solution can be exploited by the method. In other words, we are 
interested in solution vectors x with only a few large entries (say, up to k). We refer to such vectors as k-sparse. 
We will show that by using the concept of expander graphs and compressed sensing, fc-sparse delay vectors may 
be successfully estimated, provided some conditions on the routing matrix of a network are met. The estimates 
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obtained satisfy a desirable property, i.e. the difference between the true delay and the estimate (solution from 
Eq. (1)) goes to zero. We call such networks fc-identifiable and show that Eq. (1) can be solved using a linear 
programming (LP) techniques. 

Clearly, the choice of paths in the network used for probing directly affects the estimation accuracy but also 
constitutes an overhead. Hence, for a given estimation accuracy, one could attempt to minimize total number of 
probes used for network monitoring. We study this problem and arrive at a binary integer programming formulation. 
Given that this is an NP-hard problem, a heuristic algorithm is developed that provides good approximate results 
with reasonable complexity. 

In summary, our specific contributions are as follows: 

• We establish a novel connection between network tomography and binary compressed sensing via the notion 
of expander graphs; 

• We provide conditions on the routing matrix of a network under which it is (k =) 1 -identifiable. Moreover, 
we provide an upper-bound on estimation error in such cases. 

• Based on our result for 1-identifiability, we show how to choose the routing matrix for a given network such 
that it is 1 -identifiable, with minimum number of end-to-end transmitted probes. 

• We provide simulation evidence for delay estimation when the delay vector x in Eq. (1) is fc-sparse for k > 1. 
A preliminary version of our results appears in [13]. This submission is distinct due to the complete proofs of 

theorems, fuller discussion of network coding design trade-offs and a thorough set of simulation experiments. In 
addition in this manuscript, we propose an optimization problem which gives the minimum number of paths making 
the network 1 -identifiable. That would decrease the cost of end-to-end measurement in a network. 

As is customary, a network consisting of bidirectional links connecting transmitters, switches, and receivers can 
be modeled as an undirected graph N(V, E) where V (E) is the set of vertices (edges). A set of measurements 
y are obtained by end-to-end probing, given by Eq. (1). We provide analytical results for identifiability for the 
case of k = 1, which is sufficiently challenging (and unsolved to date). For case k > 1, we provide simulation 
results which show that for Internet-topology networks, the delay estimation method for k = 1 also typically yields 
acceptable estimation error. 

The paper is organized as follows: Section II relates the routing matrix of a network to bipartite graphs. Section 
III establishes a connection between link delay estimation and binary compressed sensing and identifies conditions 
on network routing matrix under which a given network is 1 -identifiable. In addition, we describe an algorithm for 
determining the expansion factor and error parameter of an expander graph. In Section IV, we look at minimizing 
number of probes sent subject to 1 -identifiability of the network. We evaluate our findings using simulations in 
Section V. The paper concludes with reflections on future work in Section VI. Proof of all theorems are given in 
Appendix-A. 

Notations: We use bold capitals (e.g. R) to represent matrices and bold lowercase symbols (e.g. x) for vectors. 
The z-th entry of a vector x is denoted by Xi. For the matrix R, A/"(R) denotes its Null space, and superscript 
' denotes it's transpose. A set is denoted by a calligraphic capitalized symbol, e.g. 1Z and \1Z\ is the cardinality 
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Fig. 1. A network with 4 boundary nodes, 2 intermediate nodes and 5 links 



(number of elements) in the set. For any set S C {1, 2, 3, ...n}, S c represents the complement. Also, for any vector 
x G R™, vector xs e R" has entries defined as follows: 

!Xi if i £ S 
(2) 
o.w. 

If x G R™, the Zp-norm of x is defined as follows: 



(3) 



i=l 

II. Routing Matrix and Bipartite Graph 

In this section we show that the routing matrix of any network can be represented as a bi-adjacency matrix of a 
suitably defined bipartite graph. This will help connect the problem of network identifi ability with expander graphs, 
a special subset of bipartite graphs. Throughout this manuscript, boundary nodes are depicted as solid circles while 
intermediate nodes are presented using dashed circles. The network in Figure 1 is used to illustrate the subsequent 
definitions. 

Definition 1. A bipartite graph is one whose vertices can be divided into two disjoint sets X and Y such that 
every edge connects a vertex in X to one in Y [14]. 

A bipartite graph is usually represented as a triple G(X, Y, H) where H C X x Y is a set with paired elements 
from X, Y. The vertex sets X and Y are called left and right sides of the graph, respectively. A bipartite graph 
G(X, Y, H) can be represented by it's bi-adjacency matrix A — [oy], where ay = 1 if node i 6 X is connected 
to node j E Y, and is zero otherwise, i.e., 



1 (i,j)€H 

(4) 

(i,j)tH 



Assume that a given network N(V,E) has a total of n links (i.e., n = \E\), 1Z is the (chosen) set of paths 
between the boundary nodes of the network and r = \1Z\. Let R rX n denote the routing matrix, where there exists 
an isomorphism between the set 1Z and the corresponding routing matrix R. For example, for the network in Figure 
1, suppose the following routing matrix is given: 
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Fig. 2. Bipartite graph corresponding to given routing matrix in Eq. (5) 
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which is equivalent to the following set of paths TZ: 

TZ = {hhhthhhihh, hh} (6) 

R rx „ can be thought of a bi-adjacency matrix of a bipartite graph G(X, Y, H) where X = E (set of links in the 
network) and Y = TZ (set of given paths in the network). There exists a connection between a node in X to a node 
in Y if a path in Y includes the corresponding link in X. Figure 2 presents the bipartite graph for the network in 
Figure 1 with routing matrix R in Eq. (5). 

Note that the above routing matrix, or its equivalent set of paths, is not a complete set of routes for the network 
in Figure 1 (e.g., doesn't include the path from n\ to rie which is hhh)- However, it is a fundamental premise in 
network tomography that the routing matrix is already chosen, and may not be changed. Hence initially we seek 
to investigate the question: assuming that the routing matrix is given, when is it possible to identify or estimate a 
single (k = 1) link parameter? In Section IV, we re-visit this question under the assumption that the routing matrix 
may be freely chosen. 

III. Expander Graphs and Network Identifiability 

In recent years, a new approach - Compressed Sensing - for estimating an n-dimensional (signal) vector x from 
a lower dimensional representation - has attracted much attention [15], [16], [17]. For any signal x e R™, the 
reduced dimension representation is equal to y = Ax, where m X n matrix A (m << n) is referred to as the 
measurement matrix. The main challenge in traditional compressed sensing is to construct A with the following 
desirable (and conflicting) properties: a) achieve maximum possible compression (m/n small) and yet allow b) 
accurate reconstruction of x from y when x is known to be sparse, using c) a fast decoding algorithm [18], [19], 
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[20], [21]. For example, when A is a binary matrix, it is known that m = O(klogj) suffices when x is k sparse. 
In scenarios where the routing matrix is already pre-determined, the main issue is to determine whether it is an 
appropriate measurement matrix for compressive sensing, i.e., if it satisfies objective b) above. To the best of the 
authors' knowledge, there is no existing literature regarding this problem. 

We first present a result on compressed sensing using expander graphs. Then, we establish a connection between 
k = 1 identifiability in a network N(V, E) to the 'expansion factor' of an expander graph. This makes it possible 
to use results from compressed sensing to address the link identifiability problem. 

A. Expander Graphs 

Definition 2. A bipartite graph G(X, Y, H) with left degree d (i.e. deg(v) = dVv e X ) is a (<f>, d, e) — expander, 
if for any $ C X with \3>\ < <j> the following condition holds: 



where N(<&) is set of neighbors o/$. Parameters <p and e are referred to as expansion factor and error parameter, 
respectively. 

Roughly speaking, in an expander graph, the degree of connectivity for any collection of nodes in the left hand 
side (X) expands to a sufficiently large number of nodes in the right hand side (Y) [22]. Expander graphs are well- 
studied; [23], [24], [25] show how to construct a (<fi, d, e) — expander graph. In a key result, Berinde and Indyk 
in [26], [27] show that the bi-adjacency matrix of a (2</>, d, e) — expander graph can be used as the measurement 
matrix for a (/(-sparse signal, for e < jq. The parameter e in an expander graph is a design variable which is related 
to recovery error. For our application, we use a larger value of e as it expands the space of expander graphs and 
hence increases the likelihood that it leads to an identifiable network. On the other hand, increasing e also increases 
the bound on error recovery and reduces the fidelity of the reconstruction. There is a consequent tradeoff between 
number of identifiable networks and the estimation error in link delay. In this paper, we focus on 1 -identifiable 
networks or equivalently 1 -sparse delay vectors. We show that for such networks, e = 1/4 yields acceptable tradeoff 
between number of 1 -identifiable networks and estimation error. As will be shown in Section V, the principal ideas 
behind our results for k = 1 appear to generalize to k > 1-sparse vectors. 

The next lemma characterizes the null space of bi-adjacency matrix of an expander graph and will be used to 
bound the error in the recovery of x from its compressive projection y. 

lemma 1. Let G(V\, V2,E) be a (2, d,e) -expander with e < 1/4 and A mxn be its bi-adjacency matrix. Assume 
w lies in the null space of A (i.e. Aw = 0) and let S be any singleton set of coordinates of w; i.e. S = {i}, 
i E {1, ...n}. Then 



\N(n > (l-e)d|$| 



(7) 



II w s ||i< 2e || w s . ||i 



(8) 
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Fig. 3. A toy network with 6 boundary nodes, 3 interior nodes and 8 links. Its routing matrix, given in Eq. (11), satisfies expansion property. 



Proof: See Appendix A. 

The following theorem provides an upper bound on error of recovering x from its lower dimensional projection 
Ax when A is a bi-adjacency matrix of an (2, d, e)-expander graph and e < 1/4. 

Theorem 1. Consider any two vectors x, x', such that they have the same projection under measurement matrix 
A; i.e. Ax = Ax'. Further, without loss of generality, suppose || x' ||i<|| x ||i. Let S be the set of largest (in 
magnitude) elements of x. Then 



x'-x h<f{e) ||x s = 



(9) 



where 



2(1 + 2e) 



e < 



1 



(10) 



1 - 2e ' ~ 4 
Proof: See Appendix A. 
Example: 

In this example, we show how the above result may be applied to a network like the one in Figure 3. Suppose 
we are given the following routing matrix: 



R 



Pi 
Pi 
P 3 
Pa 
P 5 
Pe 



m ~~> 7J3 

ni ri4 
n 3 ~» n 6 
77,4 ~^ n-j 
n 6 n 9 

Tly ^ rig 



h h h h h h h h 
1 1 
10 110 







110 10 







1 1 1 
1 1 1 
1 1 
We show that routing matrix in Eq. (11) is a) the bi-adjacency matrix of the bipartite graph presented in Figure 4 
and b) this bipartite graph is a (2, 2, l/4)-expander graph. For instance, let $ = In that case, N({1\, l 2 }), 



(11) 
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Fig. 4. Bipartite graph corresponding to routing matrix in Eq. (11) 



the set of nodes connected to {I1J2} is: 



N({l u l 2 }) = {P 1 ,P 2 ,P 3 } (12) 

which means \N({h, fo})\ = 3 and Eq. (7) holds with equality when e = 1/4. 

The null space of matrix in Eq. (11) of dimension two, is spanned by the two following vectors: 



wi = [0.36 - 0.36 - 0.23 - 0.14 0.59 - 0.23 0.36 - 0.36]* (13) 
w 2 = [-0.18 0.18 -0.45 0.63 0.26 -0.45 -0.18 - 0.18]* 
Since e = 1/4, by Lemma 1, for any w such that Rw = 0, we have: 

II w s ||i< 0.5 || w S " ||i (14) 

To demonstrate this, let S = {1}, and w = wi + 0.3w 2 . Clearly w is in null space of R; i.e. w e 7V(R). Then 
w, wj and wjc are given by: 

w = [0.31 - 0.31 - 0.36 0.05 0.67 - 0.36 0.31 - 0.31]* (15) 
w s = [0.31 0]* 

ws. = [0 - 0.31 - 0.36 0.05 0.67 - 0.36 0.31 - 0.31]* 

Thus, || w s || i= 0.31 and || wgc ||i= 2.36 and inequality (14) is satisfied. 
Let the true delay vector be: 



x=[0.1 0.1 0.1 0.1 1 0.1 0.1 0.1]* (16) 
Then the measured vector using the measurement matrix in Eq. (11) is 

y = Rx=[0.2 0.3 1.2 0.3 1.2 0.2]* (17) 
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Since R[wi w 2 ] = [0 0], any vector of the form 

x' = x + awi + /3w 2 (18) 

results in the same y. Note that a and (3 must be selected such that all entries of x' are positive. For example, let 
a = /3 = 0.1. Then 

x' = [0.12 0.08 0.03 0.15 1.08 0.03 0.12 0.08]* (19) 
Since e = 1/4, we have /(e) = 1.5. Let S = {5} be the largest element in x, then from Theorem 1 we have: 



x'-x|| 1= 0.34 < /( e ) || x s . ||i 

= 1.5 * (0.1 + 0.1 + 0.1 + 0.1 + 0.1 + 0.1 + 0.1) 
= 1.05 



(20) 



B. Network Identifiability 

We use the result of previous sections to show that under some conditions, the link delay in a network may be 
estimated as the unique solution to Eq. (1). The following theorem relates the problem of delay estimation in a 
network N(V,E) to results on expander graphs. In addition, it proves that Eq. (1) can be solved for x using a 
linear program (LP). 

Theorem 2. Let N(V, E) be a network with set of paths 1Z and corresponding routing matrix R. Suppose 
G(E, 1Z, H) is a bipartite graph with bi-adjacency matrix R. Assume x is the true delay vector of N(V, E). 
Let x* be a solution to the following LP optimization: 

min || x ||i (21) 
s.t. 

Rx* = Rx 

Then 

||x-x* ||i</(e) ||x S c ||i (22) 

if G is a (2, d, e)-expander with e < j. 
Proof: See Appendix A. 

First note that if the true delay vector x is 1-sparse, it implies || xjc = which means Zi-norm minimization 
in Eq. (21) can recover x with zero estimation error. However, if the true delay vector contains links with small 
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but non-zero delays (the more likely scenario), the estimation error is not zero and the above theorem yields an 
upper bound. 

It should be noted that in general, the reverse of Theorem 2 is not true; i.e. there exist some networks N(V, E) 
which are 1-identifiable but their corresponding bipartite graph is not an expander graph. An example of such a 
network is depicted in Figure 5-(a) with the following routing matrix: 
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The routing matrix above is a bi-adjacency matrix of the bipartite graph presented in Figure 5-(b). This bipartite 
graph is not regular in its left side since the degree of a node in left set is either one or two, and hence it cannot be 
an expander. However Figs. 6-(a) and (b), respectively, represent subgraphs of G with regular left degree one and 
two; each of these subgraphs are expander graphs. The above observation motivates us that the result in Theorem 2 
may be extendable for networks whose corresponding bipartite graph is not regular (and therefore is not expander) 
but can be partitioned into sub-graphs that are expander graphs. 
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Theorem 3. Let N(V, E) be a network with routing matrix R. Let G(X, Y, H) be a bipartite graph with bi- 
adjacency matrix R. Suppose d(Xi, Y, Hi), i = 1, 2, ...M be di-regular bipartite subgraphs of G such that: 

. X = \JX, 

. H = UHi 

. di^ dj for i ± j 

Then, N(V,E) is \-identifiable, if each of Gi is a {2 1 d i ,e)-expander graph for e < j. Further, link delay is the 
solution to LP optimization in Eq. (21). 

Proof: See Appendix A. 

It should be noted that the condition di ^ dj implies that JQ's corresponding to different GVs are disjoint. For 
future reference, we refer to the conditions in Theorem 3 as 1 -identifiability expansion conditions and if network 
N(V,E) satisfy those conditions we refer to it as 1-identifiable expander network. 

Note that the 1-identifiable expansion conditions in Definition 3 implies the following for any link pair k and If 

• They belong to different GVs and hence have different degrees di ^ dj\ 

• They belong to the same subgraph Gi, i.e. di = dj. In that case, since Gi is a bipartite graph, they satisfy the 
expansion property in Eq. (7). 

We state this observation formally in the following corollary. 

Corollary 4. Let N(V,E) be a network with routing matrix R and set of paths 1Z. Let G(E,1Z, H) be its 
corresponding bipartite graph with bi-adjacency matrix R. Then one and only one of the following statement is 
true for any two links U and lj in E, i ^ j: 

• deg(U) > deg(lj) 

• deg(k) < deg(lj) 

• deg(U) + deg(lj) — 4deg(k, lj) > 

where deg(k, lj) is defined as number of nodes connected to both k and lj in the bipartite graph G(E, 1Z, H) 1 . 
proof: See Appendix A. 

Theorem 2 (3) requires the graph G (d) to be a (2, d, e)-expander. To the best of our knowledge, there is no 
existing algorithm in the literature to determine the expansion factor or error parameter of a given bipartite graph. 
In next section, we introduce a heuristic algorithm to determine if a given bipartite graph is a (2, d, e)-expander 
and its error parameter e. 

Finally, for completeness, we state the following definition for fc-identifiable expander network. 

Definition 3. A k-identifiable expander N{V, E) is a network whose routing matrix R is the bi-adjacency matrix 
of a bipartite graph G(X,Y, H) consisting of di-regular subgraphs G(Xi,Y, Hi) with the following properties 

'it also can be stated as number of paths going through both links U and lj in the network N(V, E) with routing matrix R. 
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. X = \JX 1 

. H = UHi 

. di ^ d 3 for i / j 

• G(Xj, Y, Hi) is a (2k, di, e)-expander with e < j 

C. How to determine an Expander Graph 

In this section, we show how to determine if a given bipartite graph is a (2, d, e)-expander graph and it's error 
factor e. Suppose G(V, E) is a graph with adjacency matrix T. A walk from node v to node u is a sequence of 
edges and nodes from v to u [14]. For example, a walk from node ni to n A is n 1 / 1 n 2 ^4n4 in Figure 7. The length 
of a walk is defined as the number of links in the walk. For example, in Figure 7 there are two walks with length 
one and two from n\ to 713, nxl^nj, and ^1/1712^3^3, respectively. 

The following theorem gives an algorithm to compute number of walks with length k between two nodes in 
G(V,E) [28], [29]. 

Theorem 5. Let G(V, E) be a graph with set of vertices V — {v\, V2, vn} and set of edges E, and let T denote 
the adjacency matrix of G. Then the (i,j)th entry ofT k is the number of different walks in G of length k from Vi 

tO Vj. 

For example, the graph in Figure 7 has the following adjacency matrix: 

110 
10 11 

T = (24) 
110 

10 

Hence, T 2 and T 3 shows the number of walks between each pair of nodes with length two and three, respectively. 
Since 
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(25) 



T = 



2 4 3 1 
4 2 4 3 

3 4 2 1 
13 10 

there is only one walk with length three from ri\ to 114 (ni^hhn^). 

Now suppose G(X, Y, H) is a bipartite graph with bi-adjacency matrix A and adjacency matrix T. The following 
relation between the bi-adjacency and adjacency matrix of a bipartite graph is given as follows: 

A 

A* 

Now consider two nodes u, v 6 X. According to Theorem 5, the (u, v)-th entry in T 2 shows the number of 
walks with length two from u to v. Since G is a bipartite graph, there is no connection between nodes in X. Thus, 
all paths from node u to node v must go through a node in Y that is connected to both u and v. Therefore, the 
number of walks with length two between u and v reveals the number of their common neighbors. This observation 
proves the following lemma: 

lemma 2. Let G(X, Y, H) be a bipartite graph with biadjacency matrix A. For any two nodes in X, the corre- 
sponding entry in A* A represents the number of neighbors in Y they have in common. 



(26) 



For the bipartite graph in Figure 2, we have 

r 2 1 1 1 
12 10 1 

A ' A = 112 11 (27) 
10 12 1 
1112 

which means that any two nodes in the left hand side of the bipartite graph in Figure 2 have at most one node 
in common in the right hand side. Note that the diagonal entries of A* A represents the (common) degree of nodes 
in X. Since the graph is left 2-regular, it is easy to see that expansion is satisfied with e = 1/4. This leads to the 
following: 

Theorem 6. Let G(X, Y, H) be a bipartite graph with bi-adjacency matrix A and left regularity parameter d. Let 
A be the maximum non-diagonal entry of matrix A* A. Then G is a (2, d, e)-expander where 

A 



e = 



2d 



(28) 



Up to this point, our goal was to determine if a network with a given routing matrix is identifiable or not. In 
practice, the additional probes required along these specified routes incur a cost in terms of overhead. In the next 
section, we investigate the problem of determining the minimum number of paths required for probing that are 
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Fig. 8. Paths between boundary nodes of network in Figure 1 



sufficient for network identifiability. Minimizing the number of paths used reduces number of probes injected into 
the network. 

IV. Minimum path set selection 

As an example, consider the network in Figure 1 which has 6 paths between boundary nodes as depicted in 
Figure 8; these are collected in the following set. 

TZ = {hhhjhhhthhhjhhhJihjUh} (29) 

However, only four of them are sufficient for delay identifiability of any single link. In general, not all paths 
between boundary nodes are necessary for identifiability. The next subsection shows how to select the set of paths 
from 1Z such that a desired objective is achieved. 

A. Network Covering 

Consider a network N(V, E) with a collection of end-to-end paths 1Z. We seek the minimum number of paths 
between boundary nodes such that each link belongs to at least one path. This suffices to determine whether the 
network contains any bad link. This problem - known as 'network covering' - has been studied in literature [30], 
[5]. We provide a new algorithmic formulation in Section IV-B that finds the minimum number of paths needed 
for a given 1 -identifiable network. 

To formulate an optimization problem, define an indicator variable 1^ indicating whether path IZi E 1Z is 
included or not: 

1 IZi is used 



In, = { 

We seek to minimize the number of paths; i.e., 



(30) 

o.w. 



\n\ 

min^/ Kt . (31) 

i=l 



over all binary variables 1^ subject to the fact that each link belongs to at least one of paths. Let I l n = [1-R.ili 
be the vector of path indicators. The i-th entry of R'l^ is the number of paths that traverse the i-th link, each of 
which should be equal to or greater than 1 . We therefore seek to 
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\n\ 

min ^R, (32) 

i=l 

s.t. 

R*I TC > 1 

^ G {0, 1} 

The above is a binary integer programming which is well-known to be NP-hard; there exist a number of algorithms 
which provide good approximate solutions [31], [32] when the original problem can be 'relaxed'. 

B. Minimum Path Set for 1 -identifiability 

Among all available paths in the network, 1Z, what is the subset with minimum cardinality which guarantees 
1 -identifiability of the network? To achieve a network that is k = 1 identifiable, it is necessary that our minimum set 
'cover' the network which means R*Ik > 1, as discussed in previous section. For identifiability, we use Corollary 
4 that specifies conditions for any two links ij, lj <E E to have the network 1 -identifiable. Using the path indicator 
functions in Eq. (30), these conditions can be rewritten as below: 

• E E w 

• E E 

• E 7 ^ + E 7 ^- 4 E 7 ^° 

n k :hen k n k :ijen k ■R. k :i i eR. k ,i j en k 

As mentioned, for any two links U and Zj, one and only one of the above inequalities must be satisfied. Hence 
we use the notion of alternative constraints - a well-known device in linear binary programming [32]. Towards that 
end, we introduce three binary variables Uiij,y2ij,y3ij for any two links k and lj, i ^ j: 



Vkij = ' 



1 if the fc-th constraint is satisfied 

otherwise 

Then we rewrite the constraints in Eq. (33) in the following format: 



(34) 



n(l-y Uj )+ E E < 35 > 

"(i - my) + E In « - E In * - 1 

■R k :i 3 en k K k -.ken k 

i(i - yaw) + E 7 ^ + E 7 ^ - 4 E > o 

n k :hen k n k -.ijen k n k -.u en k ,ij en k 

yuj + y2ij + ynj = i 
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The last equality guarantees that exactly one of ykij's equals one and the others are zero. Note that if ykij = 0, 
the fe-th constraint is trivially satisfied. Therefore the constraints in Eq. (36) are equivalent to the statement that 
one and only one of in Eq. (33) holds. The above implies that if a subset of 1Z satisfies the constraints in Eq. (36), 
the network is 1 -identifiable using those paths. 

The following theorem summarized all of above findings for minimum number of paths which quarantines 1- 
identifiability of a given network N(V, E). 

Theorem 7. Suppose a network N(V, E) with routing matrix R and equivalent set of paths 1Z is given. Determining 
the minimum number of paths r which guarantees 1 -identifiability of the network can be cast as follows: 

min ^2 ^ ( 36 ) 

i=l 

s.t. 

R'Ik > 

n(l - yiij) + In «~ In * - 1 

n(l - y 2 ij) + In «- !n k >l 

n(l - y 3ij ) + £ - 4 ^ In k > 

yiij + ynj + 2/3ij = i 
y kij e {0,1} k = 1,2,3 

i Ui e {o, 1} v« 

where n is number of links inside the network (n = \E\). 

The number of constraints for the binary optimization problem in Eq. (36) is of order 0(n 2 ). However, if Eq. 
(36) doesn't have a feasible solution, it is possible that N(V,E) is still 1 -identifiable. 

C. Heuristic Algorithm 

Since the optimization problem in Eq. (32) and (36) is NP-complete, we introduce a heuristic algorithm, based on 
relaxing the binary constraints in (36) leading to a linear programming problem which is solvable in polynomial time. 
This is achieved by substituting the constraints 1^ 6 {0, 1} and y^j G {0, 1} with 1^ 6 [0, 1] and y^j <E [0, 1], 
respectively. 
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\n\ 

min ^2 In, (37) 

i=l 

s.t. 

R*I TC > 

n{l - yuj) + J! 7 ^ ~ E In * - 1 

"(1 - 2/2y) + X 7tc * ~ E 7 ^ - 1 

K k :l 3 e1l k K k :k£K k 

n(i-m)+ E 7 ^ + E 7 ^- 4 E 7 ^° 

n k :hen k n k :ijen k n k :ii£n k ,ij £iz k 

yuj + y2ij + y3ij = 1 

Vkij G [0,1] fe = l,2,3 
G [0, 1] V« 

Let 1^ be the solution to above LP optimization and I n t the maximum entry in I n , i.e., 

=argmax/^. (38) 

We use 7^i» as one of the paths for probing or equivalently set I Ut = 1 . Now we solve the LP in Eq. (37) with 
the additional constraint I K t = 1. If we run the above algorithm \1Z\ times, we achieve an approximate solution 
to the ILP problem in Eq. (36). 

If the sequence of the LP problems are all feasible, it means there is a zero-one vector In such that all the 
constraint in Eq. (36) are satisfied. This implies that the network N(V,E) is a 1 -identifiable expander and In 
contains the requisite minimum number of paths. However, if one of the LP problem in (37) is not feasible, it 
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does not necessarily mean that the ILP is infeasible. In other words, the network N(V, E) may be a 1 -identifiable 
expander while sequence of LP problems in Eq. (37) is not feasible. Figure 9 summarizes the relation between the 
feasible sets for the LP optimization (37) and the ILP problem (21), and 1 -identifiable networks. 

V. Evaluation Results 

In Section III we showed that if the routing matrix of a network is the bi-adjacency matrix for union of disjoint 
expander graphs, that network is 1 -identifiable. Moreover, we can estimate internal link delay using a LP optimizer 
in Eq. (21). However, a legitimate 'big-picture' question arises: how many networks actually satisfy conditions of 
Theorem 3; i.e., are they 1 -identifiable expanders? In this section, we generate random Internet-type networks to 
study this question. Our simulation results show that more than 70% of networks generated satisfy the conditions 
of Theorem 3 and therefore their link delay can be estimated using the l\ minimization introduced in Eq. (21). 

For those networks which are 1 -identifiable expander, i.e., their routing matrix satisfies the condition in Definition 
3 for k = 1, we determine average normalized estimation error when there is exactly k = 1 congested link 2 in the 
network. Then we explore the average normalized estimation error when there are k — 2,3 congested links inside 
the network - scenarios for which we do not have any analytical results to date. Surprisingly, even for k > 1, we 
find that the average normalized estimation error remains within an acceptable range. Finally, for those networks 
which are 1 -identifiable expander, we determine the gain (relative to probing all possible paths in the network) by 
determining the histogram for minimum number of paths required to estimate link delay for a given error tolerance. 

A. Generate Networks with Random Topology 

We use Inet version 3.0 [33], [34] - an Internet topology generator software (at AS 3 level) - to generate random 
graphs with given power law and fixed number of boundary nodes 4 . We create networks containing 5000 nodes 
with 5, 8, 10, 12, 16 and 20 boundary nodes, respectively. The output of Inet, which gives neighbors of each node 
in the generated graph, is fed to matgraph toolbox in MATLAB [35] for modification. We first create a routing 
matrix containing the shortest paths between any boundary node pairs in the network. Then we delete all nodes and 
links which do not contribute to any of the above paths, since it is clear that if a link is not covered by any end-to- 
end path, it is not identifiable. Then we remove all nodes with degree two, as it is a well-known fact in network 
tomography that nodes with degree two are not 1 -identifiable using end-to-end measurement techniques [36]. The 
remaining networks constitute our random set. In Figure 10, six examples of Internet-like random networks are 
depicted. 

2 A congested link is one with significantly elevated delay compared to the rest of the links in the network. 

3 Autonomous System 

4 Boundary nodes are nodes with degree one which act as injection point for probes in our problem. 
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(d) (e) (f) 

Fig. 10. Output of Inet after our modification in MATLAB with (a) 5 (b) 8 (c) 10 (d) 12 (e) 16 (f) 20 boundary nodes. Nodes with degree 1 
represents injection nodes 



B. Networks and Expansion Property 

For the routing matrices of these random networks, we first examine how many of them satisfy the fc-identifiability 
expansion conditions in Definition 3. For networks with fixed number of boundary nodes, fifty different topologies 
are created. Table II shows percentage of them which satisfy the fc-identifiability expansion property for k — 1, 2, 3. 
Based on this observation, we can conclude that more than 70% of any Internet-topology based network satisfies 
expansion conditions given in Theorem 3 and therefore our results are applicable to them. As can be seen, the 
number of networks satisfying the expander conditions decreases as k increases. However, our simulation results in 
the next section shows the unexpected result that even if the delay vector x is fc-sparse with k > 1, the estimation 
error by using the proposed approach in Eq. (21) remains in an acceptable range. 

To see how many networks satisfy 1 -identifiable expander conditions, we use the algorithm in Table I. For a 
given network N(V,E) with routing matrix R rxti , this algorithm simply decomposes the routing matrix R into 
sub-matrices H' r . Xn . such that number of ones in each column of sub-matrix R£,. xn . are equal and different from 
any other sub-matrix R^.. xn ., i ^ j. Then it checks if every sub-matrix RJ,. xn . satisfies condition in Theorem 6 
for e < 1/4. 

For k > 1, as mentioned before, there is no algorithm in the literature. Hence, we use exhaustive search to 
check if network N(V,E) satisfies fc-identifiable expander conditions using the Definitions 3 and 2, which has 
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TABLE I 



Algorithm : 1-identifiable networks: 

Given network N(V, E) with routing matrix R rX n-r fe is the k-th row of R. 
If N(V, E) is 1-identifiable expander eflag is 1, otherwise it is zero. 

eflag «— 1 
degs<- Efc=i r fe 
for i in degs 

index <— find(degs == i) 

R' <- R(:, index) 

A <- R" ■ R' 

for j=l to dim(A) 
Afjj) <- 

end 

A «— max(A) 
e «- A/(2i) 
if e > 1/4 

eflag <— 
end 
end 



TABLE II 

For networks with fixed number of boundary nodes what percentage are k-identifiable expanders for k = 1, 2, 3 





5 


8 


10 


12 


16 


20 


k = 1 


80% 


82% 


76% 


72% 


74% 


72% 


k = 2 


0% 


60% 


62% 


56% 


50% 


50% 


k = 3 


0% 


0% 


0% 


56% 


50% 


46% 



complexity of 0(\E\ 2k ) that increases exponentially with k. Fortunately, it is not necessary to check if a network 
is fc-identifiable expander for k > 1. Even if the delay vector x is k > 1 sparse, the LP optimization (Eq. (21)) 
based solution in Theorem 3 can still be used. Our results in Table II show that the resulting normalized estimation 
error as shown in Fig. 1 1 remains acceptable. 

As we discussed in Section III-B, the reverse of Theorem 3 is not true, i.e., there are 1 -identifiable networks 
which do not satisfy expansion property. However, our simulation results suggests that such networks are a small 
fraction of all networks with Internet topology. In other words, if a network is 1 -identifiable, with high probability 
(more than .7) it is 1 -identifiable expander; i.e. satisfies conditions in Theorem 3. 

C. Delay Estimation: Simulation Experiments 

Theorem 3 says that if routing matrix of a network satisfies 1 -identifiable expander conditions, then link delays 
in the network can be estimated using Eq. (21). To examine the accuracy of the proposed delay estimation method, 
for each network created in Section V-A, we calculated the average normalized estimation error for all links as 
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follows. A reference link is selected and assigned a delay of 10 ms. to denote that it is congested. All other links 
in the network are assumed to experience i.i.d exponentially distributed delays with average /j,, i.e., 

f l {t) = -exp{--) VI eE (39) 

where fi(t) is the delay for link I and [i G [0, 1] to denote that these links do not undergo congestion. 
We exploit the proposed LP optimization in (21) to estimate link delays. For the network, the normalized 
estimation error for each congested link inside the network is calculated as follows: 

norm, err = -L — — (40) 

II x h 

Figure 11 presents average normalized estimation error when there are k congested links inside the network for 
k = 2, 3 and LP optimization Eq. (21) is used to estimate the delay. As expected, for [i = (implying that the 
vector x is 1 -sparse), perfect recovery with zero error is possible, as shown by the corresponding plot in Fig. 11 
(b). Otherwise, the average normalized estimation error for different /i mimics the expected trend from Eq. (22). 

Although the result in Theorem 3 is derived based on k = 1, i.e. the vector delay x in Eq. (1) is 1-sparse, the 
average normalized estimation error remains in an acceptable range for k = 2 and k = 3 deficient links. In fact 
for k > 1, the average normalized estimation error has a decreasing trend and for large networks, it is almost the 
same as k — 1 case. As Table II shows, the probability of being a fc-identifiable expander is higher for a large 
network than a smaller one for k = 2, 3, supporting the decrease with k. In addition, for a large network with k > 1 
deficient links, the deficient links are far from each other with high probability. Hence the network can be broken 
into k smaller sub-networks networks, each with one deficient link each, and is the possible reason why for large 
networks, the average normalized estimation error for k > 1 is almost the same as k = 1 case. 
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D. Minimum Path Set 

As mentioned in Section EH, the goal of compressed sensing is to achieve maximum possible gain. In Section 
IV, based on our result on expander graphs, we proposed an optimization problem in Eq. (37) which gives the 
minimum number of paths (minimum r) that makes the network 1-identifiable. For networks with fixed number 
of boundary nodes which are 1-identifiable, Figure 12 shows a histogram of number of paths needed (relative to 
number of links in the network) in order to make the network 1-identifiable. After selecting paths which make the 
network 1-identifiable, the LP optimization proposed in Eq. (21) can be used to find the link delays. 

As can be seen, in all the histograms in Figure 12, most of the mass is concentrated around 0.6. That means, if 
the network has n links, with high probability r = 0.6n paths suffice to estimate link delay. Thus, the minimum 
path selection algorithm proposed in Section IV-B will save, with high probability, 40% percent of overhead probe 
traffic needed for link delay estimation compared with one where a probe is sent between all boundary pairs. 

A second aspect highlighted by Figure 12 is that there is almost no mass below 0.4. This means that for a network 
satisfying conditions in Theorem 3, i.e. being a 1-identifiable expander, — cannot be less than 0.4. Restated, to 
estimate links delay using proposed algorithm in Eq. (21) in a network with n links, the number of end-to-end 
measurements cannot be less than 0.4n. The underlying cause is that the routing matrix of a graph exhibits structure 
distinct from a random binary network. 
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VI. Conclusion 

This work presented a novel approach to estimate link delay in a network. Using the idea of binary compressed 
sensing, we developed an upper bound on the accuracy of delay recovery using an end-to-end probing method. 
Sending probes between nodes on the boundary of a network comes with the cost of increasing traffic inside 
the network. Thus, we showed how to design the routing matrix for a given network that minimizes the number 
of injected probes while preserving 1-identifiability. Future research directions include extending the analytical 
framework to networks which are k > 1 identifiable. 
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Proof of Lemma 1: 

Let A' be submatrix of A containing rows from N(S). Since \S\ = 1 and graph is left e?-regular, || A'wg ||i = || 
Awg || i= d || wg ||i. We have 



Each set of two nodes in the left part has at least 2(1 — e)d neighbor nodes at the right side (expansion definition). 
Since each node at the left has degree d, number of common nodes is at most 2ed. That means each column of A' 
(except the one corresponding to S) has at most 2ed number of ones, yielding, 



Appendix A 



Proofs of Theorems 



=|| A'w ||i = || A'wfif + A'wfife Id 



(41) 



> || A'w s ||i - || A'w s . ||i 



= d || w s ||i - || AW ||i . 




(42) 



Therefore 



0>d||w s ||i-2ed|| w S c ||i 



(43) 



which means 



ws |[i< 2e || w S c ||i . 



(44) 
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The above argument is valid if G is an expander graph. We now show that it implies e < 1/4. By definition of 
expander graph we have: 

N(S) > 2(1 - e)d (45) 

To satisfy the expansion property, N(S) needs to be strictly greater than d; i.e. N(S) > d + 1. That means 
2(1 — e)d must be at least d+1. Hence, we have the following upper bound on e 

d- 1 



e < 



2d 



(46) 



For an upper-bound that is independent of d, we find the infimum of right hand side and choose e to satisfy that 
case. Clearly we have 



-f ^ = 7 (47) 

4=2,3,4... 2d 4 



implying that e < j 5 . 



Theorem 1: 

Let vector w be in null space of A; i.e. w G A/"(A). Using Eq. (8) in lemma 1 we have: 

II w s ||i < 2e || w S c Hi 
|| w s ||i +2e || w s ||i < 2e |[ w S c ||i +2e || w s ||i 



w s ||i < 



2e 



l + 2e 



w ||i 



Now, let y = x' — x. Clearly y 6 Af(A) and we have: 



x ||i>|| x' ||i = 



> 



> 



(x + y) s ||i + || (x + y) S c ||i 
xs + ys ||l + || X S e + y S c ||i 

xs 111 - II ys 111 + II ys- ||i - II x S c ||i 
x ||i -2 || x s - 111 + II y ||i -2 || ys ||i 

4e 

x ||i -2 || x sc ||i +(1 - Y^fJ I' y I' 1 



Where in the last equality we use result in Eq. (48). Therefore we have: 



x'-x||H|y ||i</(e) || x s . ||i 



where /(e) = ^ 



(48) 



(49) 



(50) 



'note that we exclude case of d = 1, since each bipartite graph which is left 1-regular is an expander graph 
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Theorem 2: 

Let x' be the solution to optimization problem in Eq. (21). It means Rx' = Rx and || x' ||i<|| x ||i. On the 
other hand, G is a (2, d, e)-expnader graph with biadjacency matrix (R). Consequently Eq. (9) in Theorem 1 holds 
for x and x'. 

■ 

Theorem 3: 

We prove the theorem for the case in which G(X, Y, H) has only two expander subgraphs. The general case 
can be easily extended following the same way. Let G\{X 1 ,Y,H 1 ) with \X\\ = m, and 6*2(^2,^,^2) with 
\X 2 \ = n — to, be two ^-regular (di ^ cfe) subgraphs of G(X, Y, H) with bi-adjacency matrices Ri and R 2 , 
respectively. Without lost of generality, let rename the elements in X such that R = [Ri R 2 ]. 

Now suppose w e R" belong to null space of R; w = [w* w^]*. 

Let S be any set of k = 1 coordinates of w. Further, let R' be submatrix of R containing rows from N(S). We 
consider two following cases: 
Case 1: S C {1,2,..., to}: 

In this case S represents a node in Gi(X\, Y, Hi) which is a (2, d\, e) -expander by assumption. Similar to proof 
of Lemma 1, || R'wg || = || Rwg ||i= d\ || wj ||i. We have 

0=||R'w||i = || R'w s + R'w S c ||i (51) 
> || R'w s ||i - || R'w s = ||i 
= diw s - || R'w s <: ||i 

Since G\ is left di -regular R' has d\ rows. Using that, we can put an upper-bound on || R'wgc ||i as follows. 
Let r* be the i-th rows of R. Then 
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di 

||RW||i = ^|r*ws<=| (52) 

i=l 

di 1*1 

i=l j=l 
(1) di |X| 

j=i »=i 

1*1 d t 

j=i »=i 

1*1 di 

j=i »=i 

= E \ Wsc i I E ri i + E I Ws "3 1 51 ri J ' 

j=l i=l j=|Xi|+l i=l 

where for inequality (1), we used the triangular inequality and the fact that e {0, 1}. Since G\(X\,Y,H\) is 
an (2, di, e)-expander, each two nodes at the right hand side have at most 2edi neighbors at the right in common. 
That means, each column in R' has at most 2ed; i.e. J2iLi r ij < 2edi for each i — {1, 2, |-Xi|}. On the other 
hand since R' has d\ rows, J^iLi r ij -! ^i f° r eacri * — {l-^il + 1> 2, • \X\}. Using these facts and result in (52) 
we get: 

l*i| di 1*1 di 

iirwiii < X>s<yi$> y + E kjiE r « < 53 > 

j=l i=l j=|Xi|+l i=l 

l*i I 1*1 

< 2ed 1 J2\wsc j \+d 1 \ w S c j\ 

j'=i j'=I*i|+i 
= 2e || wi S c ||i +di j| w 2 || 

Substituting above inequality in (51) we have: 

> di || w s ||i -2edi || w 1S a \\ x -di || w 2 ||i (54) 
Therefore we have the following upper-bound 

II w s ||i< 2e || wis ||i + || w 2 ||i (55) 

which also can be written as below: 

II w s || i< || wi ||i +Y~fY £ II w 2 II ( 56 ) 

Case 2: 5" c {to + 1,2, ...,n}: 

By the same argument as Case 1, we have: 

0=||Rw||i = || R,'w ||i (57) 

> d 2 || w s ||i - || R'w Sc ||i 
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As in case 1, we can put an upper-bound on || R'wgc ||i as follows. 

I-Xll d,2 \X\ d 2 

||R'w S c||i < El w ^lE r ^+ E \ w S'3\Y,nj (58) 

j=l i=l j=|Xi|+l i=l 

1*1 1 1*1 

< |w s ^| +2ed 2 ^ |w s =j| 
j'=i j=|^i|+i 

< di || wi ||i +2ed 2 || w 2S <= ||i 

Using (57) and (58) we have the following upper-bound for || wj ||i. 

|| w s ||i< 2e || w 2S c ||i + || wi ||i (59) 

which also can be written as below: 

II w s ||i< || w 2 ||i +T ^- || wi || (60) 

Now suppose for a given y, two 1-sparse vectors u and v satisfies equality y = Ru = Rv. Let w = u v and 
u = [u£u 2 ]*, v = [v* v 2 ]* and w = [w^w|]*. Clearly, the following equalities hold: 

wi = Ui — v x (61) 
w 2 = u 2 - v 2 

Without loss of generality, let's assume || u ||i>|| v ||i. We consider two following cases: 
Case 1: S C {1,2, ...,m} 

II u ||i>|| v ||i = || ui + wi ||i + || u 2 +w 2 ||i (62) 

= || uis + wis 111 + II Ui S c + w iS c ||i + || u 2 + w 2 ||i 

> || uis ||i - || wis ||i + || wise ||i - || uis<= ||i + || w 2 ||i - || u 2 ||i 

= II uis 111 -(II ui 5 c ||i + || u 2 ||i) + (|| wise ||i + || w 2 ||i)- || wis 111 

Since S C {1,2,...,™}, we have || w 2 ||i + || wi S c || 1 = || w S c ||i and || u 2 ||i + || uis- ||i = || us- ||i. So Eq. 
(62) can be simplified as bellow: 

2 || u s - ||i > || w S c ||i - || wis Hi (63) 
= II w Hi -2 || wis 111 

by using Eq. (56) we have: 

1 — 2e ,, 1 - 2e „ 

2 || us- ||i> ^-j-^ || wi ||i ~Y^Y € II w 2 111 (64) 

By Eq. (61) and triangular inequality we have: 

II w 2 ||i<|| u 2 ||i + || v 2 ||i (65) 
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Applying above inequality to (64) we have: 
l + 2e 



2||ui S c lli+^f; II u 2 lli+^f II v 2 Hi 



>|| w ||i (66) 



1 - 2e 

Clearly || ugc ||i>|| Uigc d, || us<= ||i>|| u 2 ||i and || vgc ||i>|| v 2 ||i. Therefore, the following inequalities hold: 

3 + 2e „ „ „ „ 

1 _ 2e II u S c Id + || v 25 c ||i>|| w ||i (67) 

Now let j G S c . There is a path p* which goes through link j and not link in S (since it is a logical network). 
Let r p « be the corresponding row for p* in routing matrix R. Since Ru = Rv, we have: 

r p * u = r p » v > vj . (68) 

Since p* doesn't go through link S, its corresponding entry in r p » is zero. Hence we have || us<= ||i> r p »u. 
Therefore we can have the following upper-bound for every entry of Vj Vj £ S c 

II u S c W^vj Vj e S c (69) 

By adding up both side of the inequality for all j G S c we have: 

|S C | II u S c H^ll v sc ||i . (70) 

Clearly \X\ > \S C \. Therefore, the following upper-bound is valid for w = u — v: 

( r ± i + |x|)l|U5c n^i^ii 1 - (71) 

Case 2: S C {m + 1, 2, n}. By the same argument as case 1 we have: 

+ \X\) || u S c ||i>||w||i. (72) 

Note that set X in G(X, Y, if) is the same as E in network N(V, E). The rest of the proof for LP optimization 
is the same as Theorem 2. 



Proof of Corollary 4: 

N(V,E) with routing matrix R is a 1 -identifiable expander. That means bipartite graph G(X,Y,H) with 
biadjacency matrix R is union of left c^-regular bipartite graphs G(Xi,Y,Hi) such that: 

. X = ux, 

• di ^ dj for i 7^ j 

. G(Xi,Y, Hi) is a (2, d,, e)-expander with e<\ 

Now let consider two links Zj and Zj. If deg(li) ^ deg(lj) clearly one of the first two options in corollary 4 would 
be true. If deg(li) — deg(lj) — di they belong to the same subgraph, say, G{X i ,Y,H i ). By the last condition of 
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1 -identifiable network conditions, G(Xi, Y, Hi) is a (2, d, e)-expander. Now let $ = k, lj. By definition of expander 
graph in Definition 2 The following holds for $: 

|iV($)| > (l-e)d\$\ =2(1 -e)d (73) 

By Theorem 2 maximum value possible for e is \. Therefore minimum value for the right hand side of above 
inequality would be achieved if e = \ and it would be: 

\N{$)\ > ^d (74) 

d(li, lj) is defined to be number of nodes connected to both U and lj. Using that we can calculate total number of 
nodes connected to at least one of k and lj as follows: 

\N{<t>)\=d(l i )+d(l j )-d{k,lj)=2d-d(l i ,l j ) (75) 

Substituting above equality in Eq. (74) results in 

d-2d(k,lj) > (76) 

which can also be written as follows: 

2d - 2d(k, lj) = d(k) + d{lj) - Ad{k, lj)>0 (77) 
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